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We introduce a new framework that yields spectral bounds on norms of functions of tran- 
sition maps for finite, homogeneous Markov chains. The techniques employed work for 
bounded semigroups, in particular for classical as well as for quantum Markov chains and 
they do not require additional assumptions like detailed balance, irreducibility or aperiod- 
icity. We use the method in order to derive convergence bounds that improve significantly 
upon known spectral bounds. The core technical observation is that power-boundedness of 
transition maps of Markov chains enables a Wiener algebra functional calculus in order to 
upper bound any norm of any holomorphic function of the transition map. Finally, we dis- 
cuss how general detailed balance conditions for quantum Markov processes lead to spectral 
convergence bounds. 
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I. 



INTRODUCTION 



Across scientific disciplines, Markov chains are ubiquitous in algorithms as well as in 
models for time evolutions. In many cases one is interested in when their limit behavior is 
setting in. For algorithms this is often necessary in order to extract the right information 
and for time evolutions of physical systems this is the time scale on which relaxation 
or equilibration takes place. Some of the most widespread tools for bounding this time 
scale are based on the spectrum of the transition map. For time-homogeneous Markov 
chains with finite state space, the transition map is a stochastic matrix in the context of 
classical probability distributions and a completely positive trace-preserving map in the 
quantum case. Since these maps have spectral radius equal to 1, it is somehow clear that 
only eigenvalues of magnitude 1 survive the limit, that the largest subdominant eigenvalue 
governs the speed of convergence, and that the rest of the spectrum only matters on shorter 
time scales. Let T and Too be the transition map and its asymptotic part, respectively. 
We seek convergence estimates of the form 



after n time steps, where fi is the magnitude of the largest eigenvalue of T inside the 
open unit disc and K depends on the spectrum of T, on n and on the dimension of the 
underlying space. We demand that the dependence of K on n is not exponential, capturing 
the intuition that the convergence is determined by an exponential decay as fi n at larger 
timescales, while for smaller n the whole spectral data is relevant. Such bounds are of 
general interest for the theory of Markov chains, and they are especially important for 
stochastic algorithms, which are widely used in statistics and computer science. They are 
related to the sensitivity of the chain to perturbations [9, 10, 25], are used to study "cut-off" 
phenomena [2] and random walks on groups [19]. 

Before describing our main innovation, we mention two traditional, linear algebraic, 
approaches to bounding convergence times of classical Markov chains as in (1). A Jordan 
decomposition of the difference T — Too yields a bound of the form Equation (1) with 
K = k n~ d > 1+1 n d ^~ 1 , where is the size of the largest Jordan block corresponding to any 
eigenvalue of magnitude [i and k is constant with respect to n but depends on T as it is 
essentially the condition number of the similarity transformation to Jordan normal form. 
Unfortunately, there is no a priori bound on this condition number. An alternative way is 
to use Schur's instead of Jordan's normal form. This leads indeed to an expression as in 
Equation (1) where K can be bounded independent of T, albeit not of n, and we obtain 
roughly K ~ fi~ D+1 (Dn) , where D is the dimension of the underlying vector space. (See 
Section III B for details.) Needless to say, this "constant" seems to be far from optimal, 
especially it does not capture the (correct) asymptotic n-dependence of the Jordan bound. 



\\T n -TZ\\<K^ 



n 



(1) 
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When proving bounds of the form of Equation (1), one typically employs additional 
properties of the Markov chain such as detailed balance, irreducibility, aperiodicity, unique- 
ness of the fixed point, Gibbs distribution of the stationary state, etc.. Clearly, these as- 
sumptions are not always fulfilled — in particular in the quantum context detailed balance 
seems to be a less natural assumption and, furthermore, especially in the area of dissipative 
quantum computing [28] and dissipative state preparation [3, 6, 28], one aims at preparing 
rank deficient states. 

For classical Markov chains convergence estimates have been widely studied [7, 22] and 
estimates based on the Jordan and Schur decompositions have been known for many years. 
Although the latter are generally referred to as spectral convergence bounds, they do not 
provide a satisfactory spectral desciption of the convergence of a Markov chain. While in 
case of the Jordan bound it is not possible to compute K in terms of the eigenvalues of T, 
the Schur bound cannot provide the correct asympotic behavior and does not reflect the 
full spectral structure of T . So far there is no a priori estimate as in Inequality (1) such 
that K can simply be inferred from the localization of the eigenvalues of T and such that 
one obtains the correct asymptotic behavior of the chain. One goal of the present work is 
to close this gap and to understand what information the spectrum of the transition map 
of a classical or quantum Markov chain carries about the speed at which it approaches its 
stationary behavior, i.e., to determine K in terms of the spectrum of T ■ 

Our primary interest lies in the study of classical and quantum Markovian evolutions. 
However, to obtain a unified picture, in this article we will state our results more generally 
for bounded semigroups of linear maps. Any endomorphism T of a vector space V naturally 
generates a semigroup consisting of all n-fold concatenations T n , n £ N. In our analysis 
we shall assume that the vector space of endomorphisms of V carries a norm and that the 
map T is power-bounded with respect to that norm. That means there is a constant C such 
that, for any n, \\T n \\ is bounded by C. This is equivalent to saying that the semigroup 
(T n ) n >o generated by T is bounded. The framework of bounded semigroups naturally 
incorporates both classical and quantum Markov chains (see Section II B). 

We start our discussion by analyzing the asymptotic behavior of a bounded semigroup 
(T n )n>o- We discuss spectrum related properties of T that generate a bounded semigroup 
and define the asymptotic part of the evolution Too in Section III A. In Section III B we 
extend the known convergence estimates based on the Jordan and Schur decompositions 
to cope with bounded semigroups. Implicitly, the analysis covers quantum Markov pro- 
cesses, where we state new convergence estimates. Section IV contains our main result, a 
convergence estimate with the form of Equation (1), where K is fully determined by n and 
the spectrum of T ■ We start Section IV with a mathematical primer, Section IV A, con- 
taining an introduction to an entirely new mathematical toolbox in the context of Markov 
processes. We proceed by analyzing what information can be inferred from the spectrum 
of T about the speed at which (T n ) n >o approaches its asymptotic behavior, Section IV B. 
The methods, which we employ, enable us in principle to derive spectral bounds on norms 
of arbitrary functions of transition maps. When applied to power functions, we basically 
obtain the sought convergence bounds. We discuss in Subsection IV C how our new bound 
outperforms the convergence estimates based on the Jordan and Schur decompositions. 

Nevertheless, it turns out that for many application, such as dissipative quantum com- 
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putation and state preparation, the convergence estimates obtained still are insufficient to 
prove the efficiency of a possible implementation. The problem is that the convergence 
time grows with D, which in turn is exponential in the number of constituent particles 
(Section IV C). We discuss aspects related to the optimality of our new estimate as well as 
the convergence speed of contractive Hilbert space semigroups in Section IV D. We prove 
that stronger estimates, i.e. estimates such that roughly log (D) time steps bring the chain 
close to stationarity cannot rely on the spectrum of the transition map alone, the latter 
simply does not contain sufficient information. 

As an approach to better convergence estimates in Section V we extend the detailed 
balance condition for classical Markov chains and define this property in the context of 
bounded semigroups, which then includes quantum evolutions. The core theorem of this 
section is an extension of a convergence estimate that is frequently used to prove cut-off 
behavior for classical Markov chains (Section VB). 

Our discussion focuses on general bounded semigroups but the corresponding statements 
about classical and quantum Markov processes are implicit, and we will frequently use 
these for illustration. In what follows one can think of T either as a quantum channel or 
an ordinary stochastic matrix. 

II. PRELIMINARIES 
A. Bounded Semigroups 

Throughout this paper V will be a real or complex vector space of finite dimension D. 
The set of linear endomorphisms of V will be denoted by C(V), which shall be endowed 
with a norm ||-|| . For a given T 6 £(V) we consider the semigroup (T n ) n >o = \J~ n \ n £ N} 
of linear maps on V generated by T ■ Throughout, we assume that (T n ) n >o is bounded, 
i.e., there is a constant C > such that sup n>0 ||T n || < C. 

Our main approach applies for (T n ) n >o with a general norm. Nevertheless, for certain 
results concerning convergence of classical and quantum Markov chains it will be convenient 
to endow V with a scalar product (-|-). We will consider the induced Hilbert space norm 
(shortly, 2-norm) \\v\\ 2 = \J {v\v) and the operator norm (shortly, oo-norm) on C(V) defined 

by |7loo = SU P«^0 ^Tdl^ 2 ' ^ n some °f our exam pl es ( e -g-, classical Markov chains) it is 
useful to fix an orthonormal basis {ej}j=i £> for V. In this case we write T for the matrix 
representation of T with respect to {ej}j, i.e., T^j = (ej|7~(ej)}. We will emphasize whether 
or not V has such additional structure in the corresponding sections. 

B. Classical and Quantum Markov chains 

We briefly review the definitions of classical and quantum Markov chains and discuss 
certain related concepts. 

A classical, finite and time homogeneous Markov process is characterized by a semigroup 
generated by a classical stochastic matrix T. More precisely, in this scenario V = R D 
equipped with the canonical basis {e{\i and standard scalar product. The assertion that 
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T is stochastic is equivalent to Tij > and J2i Tij = 1 • The latter is equivalent to saying 
that the vector e = Yli=i e i £ V is fixed by the adjoint map, T*(e) = e. In the context of 
classical Markov chains the 1-norm plays an exceptional role. For v G V we write Vi = {ei\v) 
and define \\v\i = YliLi \ v i\- The induced norm on the set of matrices M acting on V is 
called the 1-to-l norm, 

v^o Fill 

The 1-to-l norm and the oo-norm (i.e. the 2-to-2 norm) are equivalent with 

D- 1/2 IMIU < IMH^ < \M\„ . (2) 

It is easily seen that ||T'|| 1 _ ) . 1 = 1 for any stochastic matrix T. We note that if ||-|| is any 
norm such that \T\ < C holds for all stochastic matrices T, then ||T n || < C Vn G N; that is 
the Markov chain constitutes a bounded semigroup with constant C. Since we are working 
in finite dimensions, such a semigroup is bounded with respect to any norm. 

A time homogeneous quantum Markov chain is also characterized by a semigroup. In the 
context of quantum evolutions, however, the space V has different and additional structure. 
In this article we think of V as the real vector space consisting of hermitian matrices acting 
on a complex Hilbert space of dimension d, i.e., D = d 2 . A matrix p G V that is positive 
semidefinite (p > 0) and has unit trace (tr[p] = 1) is referred to as a quantum state. An 
element T G -C(V) is called positive iff X > implies T(X) > for any X G V, and 
trace-preserving iff tr[7~(A)] = tr[A] VA G V. T is trace-preserving iff the adjoint T* of 
T with respect to the Hilbert-Schmidt inner product {X\Y) = tr(AY) on V preserves the 
identity matrix, T*(l) = 1. If T<S>X, with I being the operator identity, acts as a positive 
map on V <g> V, then T is called completely positive [12, 16]. We denote by 1 the subset of 
C(V) containing trace preserving and positive maps (TPPMs) and by T + C T the set of 
completely positive maps in T (TPCPMs). The latter describe the dynamics of a quantum 
system, whenever the evolution of the system is independent of its history, and they are 
called quantum channels in the realm of quantum information theory. 

For X G V we denote by \X\ X the Schatten 1-norm of X. The induced distance 
\\p — of two quantum states p and a corresponds to the maximum probability to detect 
a difference between p and a in an experiment, i.e. 

\\p-a\\ 1 = sup |tr(0(p- cr))|, 
l|OIU<i 

where WOj^ stands for the largest singular value of O G V. For linear maps M. G C(V) we 
define 1 the induced 1-to- 1-norm via 

,„ n \M(X)U 

l-s-l = sup — — . 

X=i0 II A 111 



1 Note that in this article we define the 1-to-l norm with the supremum taken over Hermitian matrices. 
Alternatively, the supremum could be taken over all matrices. The resulting norms are different, but 
the latter can be upper bounded in terms of 2 times the former. 
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The diamond norm is the "stabilized version" of the 1-to-l norm, 

IMIoHMaxi^, 

where X denotes the operator identity in C(V). It is the dual of the norm of complete 
boundedness (CB-norm), i.e., we have l-Mf^ = The diamond norm and the 

1-to-l norm are equivalent with [23] 

||M|| 1 ^ 1 <||M|| <> < J D 1/2 ||^|| 1 _ 1 . (3) 

For any quantum channel T we have ||T||i->i = ||T||o = 1 [16]. The distance \\£ — T\ of 
two channels £ , T measures how well these channels can be distinguished by any quantum 
experiment. In the quantum context the 1-to-l norm and the oo-norm (i.e., the 2-to-2 
norm) are equivalent with 

^~ 1/4 IITIloo < IITIIl^l < IITIloo ■ (4) 

Note that due to the different structure of the underlying space V in case of quantum 
Markov chains, the above differs from the Inequalities (2). 

If we are given a norm such that \\T\\ < C VT £ X (or VT £ X+) the quantum Markov 
chain generated by T constitutes a semigroup bounded by C. Again, due to D < oo, this 
implies that the semigroup is bounded with respect to any norm. 



C. Spectral properties 

To each linear map M. 6 C(V) we can assign a spectrum a(M) via the usual eigenvalue 
equation: we have A G a(M) if and only if there is X / with M(X) = XX. We 
write m_A4 for the minimal polynomial associated with Ai (i.e., the minimal degree, monic 
polynomial that annihilates A4, mj v \{M) = 0) and |mjn| f° r the number of linear factors 
in rriM- Another important object is the Blaschke product associated with m_/vj, 

B <*> = II (5) 

where the product is taken over all i such that the linear factor z — \ occurs in rriM respect- 
ing multiplicities. Thus, the numerator of B as defined here is exactly the corresponding 
minimal polynomial, uim- 

For convenience, we shall always assume that the eigenvalues in a(M) are arranged 
such that their magnitudes are non-decreasing. (This ordering is not unique when several 
eigenvalues have the same magnitude. This ambiguity will, however, be irrelevant in the 
following. Whenever the situation occurs that we pick an eigenvalue of a certain magnitude 
|A| we mean that we can take any eigenvalue that has this property.) 

For any M G C(V) the largest magnitude of all eigenvalues is the spectral radius, 

which we denote as [i. It follows from Gelfand's formula \i = lim^oo ||A4 || [4] that 
the spectral radius of any element of a bounded semigroup is at most 1. For stochastic 
matrices and TPPM it is clear that 1 is an eigenvalue of T. 
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III. LIMITING BEHAVIOR AND CLASSICAL CONVERGENCE ESTIMATES 

A. Limiting behavior of (T n ) n >o 

In this section we begin our discussion of spectral convergence bounds for semigroups. 
Based on a spectral decomposition of T G (T n ) n >o we introduce a map Too and show that 
this map reflects the behavior of (T n ) n >o for large n. In the following we extend known 
spectral convergence bounds for classical Markov chains to the more general semigroup 
setup. We consider the classical derivations based on the Jordan and Schur decomposition 
(Section III B). For this reason in this section we assume that V carries a scalar product. 

Our main result Theorem IV. 3 will later outperform the bounds proven in this section 
in terms of convergence speed even in the context of classical Markov chains. Moreover, 
the techniques introduced there will allow us to consider general norms, which are not 
induced by a scalar product. 

To formalize our intuition that the spectrum of T determines the convergence properties 
of (T n ) n >o let us consider a Jordan decomposition of T, 

T = ^(A^+M), with (6) 

i 

MiVi = ViMi = Mi, V{Pj = 5 i:j Vi Vi, j. (7) 

Here, the summation is taken over all distinct eigenvalues of T, the Vi are projectors whose 
rank equals the algebraic multiplicity of Aj and the Mi are the corresponding nilpotent 
blocks. All contributions to T n that stem from eigenvalues of T with magnitude smaller 
than 1 will vanish with increasing n. Hence, we expect the image of T n to converge to a 
subspace of V spanned by all eigenvectors of T whose eigenvalues are of magnitude one. 
We therefore define the linear map Too whose range is this subspace by 

Too := Yl XiV ^ ( 8 ) 

where the Vi are spectral projectors corresponding to the eigenvalues of T of magnitude 1. 
In cases where the spectral radius of T is strictly smaller than 1, To is simply zero. If T has 
only one eigenvalue of magnitude one and this eigenvalue is equals 1, then the sequence T n 
converges to 7^o, which is the unique rank one projection onto the stationary eigenspace of 
T ■ In the following lemma we shall prove that Too mirrors the limit behavior of (T n ) n >o 
also in the more general case. More precisely, as n is increasing \T n — 7^|| approaches 
and, for every k G N, Tx> indeed is an accumulation point of (T n ) n >o- The latter 
assertion is relevant especially in the case of classical and quantum Markov chains: the set 
of stochastic matrices (or quantum channels) constitutes a closed set in the corresponding 
space, which implies that To is again a bona fide stochastic matrix (or quantum channel). 

Lemma III.l (Limiting behavior of T n ). Let (T n ) n >o be a semigroup within C(V) such 
that \\T n \\ < C Vn G N and let To be as in Equation (8). Then we have that 
i) all eigenvalues ofT with magnitude 1 have trivial Jordan blocks (i.e., |Aj| = 1 =4> Mi = 0), 
n) (T n - 7Z) = (T - Too)™ V n G N\{0} ; 
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iii) lim n ^oo \\T n - T£\\ = 0, 

iv) for any k £ N, T^ is contained in the closure of (T n ) n >o * n £(V)> 



Proof, i) We proceed by contradiction and consider IT™^. Since (T n ) n >o is bounded and 
in finite dimensions all norms are equivalent there is < K\ < oo with || T - "" || < K\. On 



the other hand there is K 2 > with fT™!^ > K 2 E 7 ( A i^i + Nj) 



If A,- has a non- 



trivialJordan block the latter can be lower bounded by ^A^jVj + Nj) n > |Aj| n n. It 

oo 

follows that if | Aj | = 1 and Aj has a non-trivial Jordan block then || T n || ^ grows unboundedly 
with n? 

ii) follows from the relations in (7) since T%o = ToT = T^. For n > 2 the statement 
follows by induction. 

iii) By the previous assertion ||T n — T^J = ||(T — T 00 ) n | holds. The spectral radius [i 
of the map T — Too is strictly smaller than 1. We have from Gelfand's formula that 
linin-s.oo \\T n — T^J 1 /" - = (i < 1 and hence for all n sufficiently large \\T n — T^\\ < (j-^j ■ 
With increasing n the right hand side goes to and the claim follows. 

iv) We prove that for fixed k there is a subsequence (T n ')i that converges to 7^,, that 
is lim^oo \\T ni —7^1 = 0. To achieve this we subdivide V into the invariant subspace 
of T corresponding to all eigenvalues of magnitude 1 and its complement. On the latter 
subspace we can directly invoke iii). On the former subspace, it is sufficient to find, for 
any e > 0, a subsequence (T ni )i with the property that — A^| < e simultaneously for 
all i with |Aj| = 1. The existence of such a subsequence follows from Dirichlet's Theorem 
on simultaneous Diophantine approximation, [21] Theorem IB. 

v) By iv) for any k £ N and any e > there is n such that ||T ra — 7^J < e. This implies 
that ||7^|| < C + e Ve > and hence the claimed inequality. □ 



B. Jordan and Schur convergence estimates 



Our next aim is to understand qualitatively by how much for certain n the evolution 
7~™ differs from its limit behavior, i.e., how small the quantity \\T n — 7^1^ = \T n — T^\ 
is for any bounded semigroup. We shortly review two standard methods to obtain such 
estimates. Both methods rely on the fact that 7~ n — 7^ = (T — Too) n an d perform a 
transformation of T—T^ to upper triangular form. While the first approach is to choose the 
Jordan normal form for T — T^, the second one is based on the Schur decomposition. Both 
decompositions involve a similarity transformation A that brings T—T^ to upper triangular 
form, i.e., T — = A(A + AQyl -1 with diagonal A and nilpotent N. While in case of 
Jordan decomposition A + N has Jordan block structure, for the Schur decomposition A 
is unitary. 

Theorem III. 2. Let (T n ) n >o be a bounded semigroup in C(V), let Too be the map intro- 
duced in (8) and let fj, be the spectral radius of T— Too- Then there are constants C±,C 2 > 



2 See also the derivation of the lower bound in Theorem III. 2. 



9 



such that, for all n> 1, 

where is the size of the largest Jordan block corresponding to any eigenvalue ofT— Too 
of magnitude fi. 

Proof. We first state an upper bound on ||(A + AT)™^ with diagonal A and nilpotent 
upper-triangular N. We note that any monomial in N and A vanishes if the total degree 
of A~ is larger than or equal to D — 1. Using this together with the triangle inequality in 
the binomial expansion and exploiting the sub-multiplicativity of the oo-norm we find 

min{n,D— 1} . . 

IKA + ATI^ £ QimtlAf^. (9) 
fc=0 ^ ' 

Let now J(Aj) be a Jordan block with diagonal part \\ and nilpotent part Aj. We 
consider the Jordan decomposition T — = A v J u (\i)j A -1 , where the summation 
goes over i, which labels the different eigenvalues of T — Too, an d over v, which enumerates 
the Jordan blocks corresponding to an eigenvalue Aj. We introduce the constant k = 
inf (l-Afoo ll^ ^loo) where the infimum is taken over all A that bring T — Too to Jordan 
form. It follows readily that 

«- 1 |^loo<|r n -^loo<"l^loo ( 10 ) 

with J = ®j j, </i/(Aj). For any J u (\i) there is an no such that for all n > uq one has 
l^(Aj) \ 00 5; I Jmax (Ajnaa; ) n || 00 , where J m ax(^max) denotes the largest Jordan block corre- 
sponding to an eigenvalue \ ma x of modulus fi. Therefore, to find an upper bound on the 
right hand side of (10) we can subdivide J m ax(^max) m a nilpotent and a diagonal part 
and use Inequality (9). We note that for k < — 1 we can bound (£) < n d ^~ x and taking 
everything together we obtain for large enough n 

\\ Tn - T ^ioo < k Yl ^ _ v n_dM+1 , 

fc=0 

which proves the upper bound in Theorem III. 2 since it can be extended to an upper bound 
valid for any n G N by a rescaling of C\ . The lower bound is a consequence of the following 
inequalities for n > d^ — 1 

n-dn+l( n \ < II t /\ nt* II <- II in 
P \ j 1 / — II °rnax y^max ) Hoc — \\ u 

□ 



^ -y J — 1 II " /ftuu; \' •itiwj; / WOO — II II OO ' 



One problem with the above proof is that no and thus C2 can get large if there is a 
sub-dominant eigenvalue close to the spectral radius. Another issue is that one cannot a 
priori bound k for general T. Consequently, only little is known about C\ and C2. Most 



10 



awkward, C\ and C2 depend on the given channel T, i.e. are not universal for all channels 
of a given dimension. For this reason Theorem III. 2 is a qualitative statement about the 
asymptotic behavior of the semigroup. In contrast, the Schur decomposition allows us to 
state an upper bound on the rate of convergence that only depends on n, D and fi. This 
goes at the price of a rather pessimistic estimate. 

Theorem III. 3. Let (T n ) n >o be a bounded semigroup in £(V) such that ||T n ||oo < C Vn € 
N and let fi be the spectral radius ofT — Too- For any n G N it holds that 

\\T n - TZl^ < 2 f i n - D+1 n D ~ 1 (fi + 2C) D -\ 

Proof. As already mentioned, this will be proven based on the Schur decomposition T — 
Too = U(A + N)W , where U is unitary. As before we can rely on the binomial expansion 
Inequality (9). We note that WUf^ = 1 and that for n > 1 

D-i , x D-l 

E u) £ E »* £ 2 " D_1 - 

k=0 V J fc=0 

Thus, using the sub multiplicativity of the oo-norm it follows from (9) that 

|T"-2£loo < 2n^-V n - D+1 max(l,||Ar|^- 1 ). 
In addition we have that N = T — - A and therefore fATf^ < 2C + \x. □ 

To obtain a convergence estimate for Markov chains in 1-to-l norm we can rely on the 
Inequalities (2). The corresponding statement of Theorem III. 2 is immediate. Analogously, 
Theorem III. 2 can be used to estimate the speed of convergence of TPPMs in 1-to-l and 
diamond norm via the Inequalities (4), (3). 

Due to the lower bound in (2) the singular values of stochastic matrices are bounded 
by D 1 ! 2 from which we infer that C < D 1 / 2 in this case. For positive, trace preserving 
maps the singular values are bounded by D 1 / 4 ([17], or by the norm equivalence (4) and 
the fact that ||7 _ || 1 _ >1 = 1). Thus, Theorem III. 3 includes a convergence bound for both 
classical stochastic matrices and TPPMs. For a more detailed discussion of the resulting 
estimates in the quantum context see Subsection IV C. 



IV. MAIN RESULT: SPECTRUM AND CONVERGENCE 

The main contribution of this article is to introduce a new formalism that yields spectral 
bounds on norm of functions of transition maps of Markov processes and to apply this 
formalism to prove new estimates for the convergence of a such processes to stationarity. 
The core technical innovation will be to employ a Wiener algebra functional calculus in 
the context of bounded semigroups. To prove our estimates we will rely on the theory of 
function algebras, functional calculi and model spaces. To our knowledge these concepts 
have not found their way into the theory of classical or quantum Markov processes so far. 
For this reason at first we briefly introduce the mathematical framework in Subsection IV A. 
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A detailed introduction to the mathematics involved goes beyond the scope of this article 
and we refer to [13-15] for this. In Subsection IV B we employ the mathematical machinery 
to the context of bounded semigroups and derive the main theorem. In the subsequent 
subsection we discuss our main result and compare it to the convergence estimates from 
Jordan and Schur decompositions. 



A. Function spaces and functional calculi 

In this subsection we discuss the problem of bounding the norm of an operator acting on 
a Hilbert or Banach space in general. In the purely mathematical literature this problem 
is studied extensively [13, 15] and in this exposition we follow the theory developed in 
[14]. The main idea is to consider certain spaces of functions and to relate the operator 
under consideration with a certain function. The norm of the operator is then bounded by 
bounding the norm of its "function representative". 

We begin by defining the function spaces, which will be relevant in our discussion. The 
space of analytic functions on the open unit disc D = {z G C||z| < 1} is denoted by 
Hol(S}). We will be concerned with certain subspaces of Hol(0), an important class of 
which constitute the Hardy spaces. For p > those are defined as 

H? := {/ G Hol(B)\ \\f\\ p HP := sup — / |/(re*) W < 

0<r<l -s7T J 

and 

:={f€Hol(B)\UlH- :=sup|/(z)| <oo}. 

It is immediate from the definition that the spaces H p are vector spaces, that the mapping 
/ i-)- Wflfjp is a norm for p > 1 and that H p C H q for p> q. In the special case p = 2 the 
Hardy norm can be written using the Taylor coefficients of the analytic function /. More 
precisely, we write f(z) = J2k>o f(k) zk an d use Parseval's identity to conclude that 

r>27r 



^ 7r |/(re^)| 2 d^ = ^|/>)| 2 . 



o<r<i 2vr Jo 

Thus, / G Hol(B) is in H 2 if and only if Y, k >o \f( k )? < 00 ( see [ 15 ]> P- 32 )- The Wiener 
algebra is defined as the subset of Hoi (3) of absolutely convergent Taylor series, 



W:= {/ = £/>)^|£|/>)|<oo}. 



fc>0 fc>0 

For a given class of operators (for instance Hilbert space contractions or power bounded 
operators) the associated function algebra is a space of analytic functions that mirrors the 
"boundedness properties" of those operators. A functional calculus is a map that associates 
operators from the given class and elements of the function algebra and relates the norms 
of an operator and its representative in the function algebra. More precisely we have the 
following definitions [14]: 
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Definition IV. 1 (Function algebra). A unital Banach algebra A with elements in Hoi (3) 
will be called a function algebra, if 



1. A contains all polynomials and lim n _> 

2. (a € A, A € B, a(A) = 0)^^eA 



,,1/n 



Definition IV. 2 (Functional calculus). Let X : B ^ B be an operator on a Banach space 
B. A bounded algebra homomorphism from a function algebra A into the set of linear 
operators on B, 

J x : A ->■ L(B), 

will be called a functional calculus for X, if it satisfies Jx{z) = X and Jx(l) = 1. 

(In our case it is sufficient to assume that B has finite dimension.) Intuitively Jx 
captures the notion of "plugging an operator into a function", that is for a £ A we have 
a(X) = Jx{o) and by the boundedness property there is a constant Cx such that 

\\a(X)\\ < C x \a\ A . 

Given a family V of operators we say that this family obeys a functional calculus with 
constant C if each X £T admits a functional calculus with Cx < C. Thus, one approach to 
the problem of bounding the norm ||a(X)|| for X £ T is by constructing a functional calculus 
for the family T and then bounding the norm of a in the function algebra. For us, two 
instances of functional calculi will be important. In the first example we consider power- 
bounded Banach spaces operators, while the second one treats Hilbert space contractions, 
i) Consider a family T = {X £ L(B)\ \\X n \\ < C Vn £ N} of Banach space operators that 
are power bounded by some constant C. This family admits a Wiener algebra functional 
calculus since for any f £ W and X £ T 



ll/POII = 



k>0 



fc>0 



WW 



(11) 



k>0 



holds. 

ii) In Section IV D we discuss the semigroup of Hilbert space contractions T = {X £ 
L(H) \ \X\ < 1}. This family allows for an H°° functional calculus (with constant C = 1), 
since by von Neumann's inequality [16] we have for any / £ H°° and X £ F 



ll/POIL < in 



11° 



(12) 



At first glance, the outlined procedure seems to be of little use since the right hand sides 
of (11), (12) do not depend on X anymore. To obtain a better bound one can rely on the 
following insight. Recall that the minimal polynomial mx annihilates the corresponding 
operator, i.e., mx(X) = 0. Instead of considering the function a directly, we add multiples 
of m = mx (or any other annihilating polynomial) to this function and consider c = 
a + mb, b £ A instead of a. It is immediate that ||a(X)|| = ||c(X)||. The following simple 
but crucial lemma summarizes this point: 
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Lemma IV. 1 ([14], Lemma 3.1). Let m / be a polynomial and let T be a set of operators 
that obey an A functional calculus with constant C and that satisfy m(X) = MX 6 T. 
Then 

\\a(X)\\<C\\a\\ A/mA , vxer, 
where \\a\\ A / mA = inf {\\c\\a\ c = f + mb, b e A}. 

Proof For any b £ A we have that \\a(X)\\ = ||(a + mb)(X)\\ < C \\a + mb\\ A . □ 

B. Spectral bounds for the convergence of Markovian processes to stationarity 

Crucial for the main result Theorem IV. 3 is that classical stochastic matrices and 
quantum channels both obey a power-boundedness condition. Given any norm ||-|| such 
that every T 6 5 satisfies ||T|| < C, then for all n > 0, ||T" n || < C, i.e., T generates a 
bounded semigroup (T n ) n >o- I n view of Lemma IV. 1 this entails that (T n ) n >o obeys a 
Wiener algebra functional calculus with ||/(T)|| < C|/lvr/mW Although this observation 
is simple, we state it in a separate theorem to emphasize its importance. 



Theorem IV. 2. Let (T n ) n >o be a semigroup bounded with constant C and let m be the 
minimal polynomial ofT, m(T) = 0. Then 

\\f{T)\\<C\\f\\ w/mW 

holds for any function f G W . 

Theorem IV. 2 can be used to bound various functions of transition maps of Markovian 
evolutions. For instance one might be interested in bounding the norm of the inverse 
of a transition map (if it exists). In [14] an estimate of X^ 1 is derived for an algebraic 
Banach space operator X by using Lemma IV. 1 and bounding H- 2-1 !^^^- This estimate 
immediately carries over to Markov transition maps. In this article we seek bounds for the 
rate of convergence of a semigroup; we will use Theorem IV. 2 to relate this problem to the 
one of bounding || z n \ yy/ m yy ■ In this way we obtain the following: 

Theorem IV. 3. Let (T n ) n >o be a semigroup bounded by C , and let Too be its asymptotic 
evolution introduced in (8). We write m = ^T-Too f or the minimal polynomial and \i for 
the spectral radius of T — Too and B for the Blaschke product (5) associated with m. Then, 
for n > we have 

n(l- (1 + ^) |*|=Mi+i/n) 

Before we proceed to the proof of Theorem IV. 3 let us discuss some immediate conse- 
quences. First, note that the condition n > fi/(l — fx) does not significantly restrict the 
range of n, where the theorem applies. For n < /x/(l — /x) it holds that the exponentially 



1 

W) 
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decaying factor fi n <; e _M is still of order 1. In this range bounds of the form (1) only yield 
a trivial statement. 

As compared to Theorem III. 2 and Theorem III. 3 the bound in Theorem IV. 3 depends 
more explicitly on the spectral properties of T — Too- The Jordan block structure of 
T — Too is reflected by the fact that the formula contains a certain factor for each factor 
of m-f-Too- in contrast to Theorems III. 2, III. 3, Theorem IV. 3 clarifies in which way the 
Jordan structure of T — Too influences the speed of convergence of a Markov process. 

The upper bound in Theorem IV. 3 can be made more explicit by taking the supre- 
mum over all factors in the Blaschke product individually. It is not difficult to see (see 
Appendix A) that for |A| < fi(l + 1/n) < 1 one has 



This leads to the following corollary: 

Corollary IV. 4. Let a(T — Tx>) = {X±, Xd} be the spectrum of T — To so that the 
magnitudes are ordered non-decreasingly and let fj, = \Xd\ be the spectral radius of T —Too- 
Under the assumptions of Theorem IV. 3 it holds that 



where the product is taken over alii such that the corresponding linear factor (z—Xi) occurs 
in a prime factorization of raj (z — Xd), respecting multiplicities and Ad stands for any 
eigenvalue of magnitude [i. 

Every eigenvalue of magnitude [i contributes one factor proportional to n//i in Equa- 
tion (13). Whereas Theorem IV. 3 contains an inverse Blaschke factor for each linear factor 
in the minimal polynomial m, in Corollary IV. 4 we have canceled one of the factors corre- 
sponding to the spectral radius /i by the (i/n prefactor in Theorem IV. 3. 

A detailed discussion of Theorem IV. 3 and Corollary IV. 4 follows in Section IV C. Here, 
let us just mention some situations in which the above bound might be applied. 

1. When T is the transition matrix of a classical time- homogenous Markov chain, Theo- 
rem IV. 3 can be used to estimate the distance of T n to stationarity. For the classical 
1-to-l norm it holds that fT™!-^! = 1 for any stochastic matrix and any natural 
number n, such that Theorem IV. 3 applies with (7 = 1. 

2. For all T £ 1+ and any n we have that ||T n || = 1. Thus, Theorem IV. 3 provides a 
convergence bound for quantum Markov chains with C = 1. 

3. Theorem IV. 3 holds for general power bounded operators (in finite dimensions) 
whose spectrum is contained in the unit disc. Therefore our result applies to cone- 
and base-preserving maps with the corresponding norms, more general than transi- 
tion matrices of classical Markov chains and TPPMs. An important class of such 
operations constitute LOCC maps [18]. 



sup 

|z|=/i(l+l/n) 



l-Xz _ 1 - (1 + l/n)n\\\ 
z — X fi — \X\ + fi/n 



(13) 




m/(z-\ D ) 



n 
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4. In the context of classical and quantum Markov chains one is often interested in the 
quantity \T n (v) — 7^(v)|li> where, depending on the context, v is either a probability 
vector or a quantum state. If v is contained in an invariant subspace Vi nv of T it is 
clear that one can improve the bound in Theorem IV. 3. We then have that 

1 



n 



(l-(l + i) M ) J/ " k|=Ml+l/«) 



B(z) 



< 



4e 2 v^(H + lK j-j 1 - (1 + £) M |Ai| 



where now 5 = -E>(T-Toc) lIlt , i s the Blaschke product corresponding to the minimal 
polynomial m = rn^ T _ Toa ). nv of T - Too restricted to Vj nv . 

For the proof we present an upper bound on ||.4 n || for a general power bounded operator 
A, whose spectrum is contained in B and we specialize to the case A = T — %o only at 
the end. More precisely, we start with any A £ £(V) whose spectrum is contained in the 
open unit disc and suppose that \\A n \\ < C for all n £ N. For convenience we assume that 
the eigenvalues {Ai, A^} of A are ordered with non-decreasing magnitude. 

In what follows we shall assume that the map A is diagonalizable, i.e., its minimal 
polynomial decomposes into pairwise distinct linear factors. This assumption does not lead 
to any difficulties when it comes to finding upper bounds of the type of Theorem IV. 3. 
To see this, assume that, for each fixed n. Theorem IV. 3 holds true for any A such that 
the minimal polynomial mjs, decomposes into pairwise distinct linear factors. To pass to 
the case when A has non-trivial Jordan structure one slightly perturbs the spectrum of A 
and obtains a diagonalizable map A + e. Note that for sufficiently small e the spectrum of 
A + e still is contained in the open unit disc, such that A + e is power bounded with some 
constant C e . In the limit of e — > 0, C e converges to C [14]. Thus, for each fixed n one can 
apply the theorem for diagonalizable matrices and pass to the limit e — > on both sides 
of Theorem IV. 3. By continuity of the norm this implies the claimed statement. 

Proof of Theorem IV. 3. We adapt techniques developed in [14] for general power bounded 
operators (see Theorem 3.20) and invoke Lemma IV. 1 to transfer the problem of estimating 
||„4 n || to the one of bounding \z n \ w / m w ^ follows from the definition of the function 
algebra, that [14] 

IHw/mW = ™f{\\g\\w \9ZW, g(X t ) = A™}. (14) 

This means that the problem of bounding \\z n \\ w / mW is equivalent to finding a minimal 
norm function g that interpolates the data set (Ai, A"), (A| m |, AjJ.) in the sense that 
g(\i) = A™. (More generally, the problem of bounding a function / of a quantum channel 
is related to an interpolation problem in the Wiener algebra by replacing A" by /(Aj).) The 
strategy of our proof will be to consider one specific representative function g in (14) and 
bound its norm. To achieve this we employ the following method. Instead of considering 
g directly we choose a "smoothing parameter" r and pass to a "stretched" interpolation 
function. 
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Given any function / € H and r 6 (0,1), we write f r (z) := f(rz) = 2~^ fc>0 f(k)r z and 
observe that by the Cauchy-Schwarz Inequality 



I/. 



r\\W 



< 



fc>0 



< 



„2" 



(15) 



This idea was used to obtain bounds to the inverse and resolvent of a power bounded 
operator in [14]. 

We use the Blaschke products B(z) = 11^ *Zy z anc ^ B(z) = IT fz^i ' wriere in the latter 
product the spectrum is stretched by a factor of r. (The products are taken over all prime 
factors of m_4, but to avoid cumbersome notation we do not write this explicitly.) Consider 
now the function g with 



9(z) 



iA fc i 2 )n\^. 



g is analytic in the unit disc and g(X) = X n for all A G f(T). To be able to use the 
estimate (15) we perform the aforementioned smoothing. We define the modified function 

g b y 



B(z) 



1 r 1 - r 2 Xj\k 

n rAfc _ rXj 



and observe that g r enjoys the same basic properties as g, i.e., g r is analytic in D and 
g r (A) = X n for any A G <r(T). Thus, by Equation (14), we have that ||z ra || w / mW ^ IliMw 
and it follows from Inequality (15) that 



Ik/ 



r\\w 



< 



1 



By the Maximum Principle for analytic functions |<7||#oo is attained on the unit circle, 
that is IpIjjoo = sup| z | =1 |g( z )l- Exploiting the fact that each elementary Blaschke factor 
preserves the unit circle, we conclude that 



\\9\\ H ° 



sup 

Ul=i 



i 



r 2 \X k \ 2 



jjl- ^ 2 AjA fc 



To bound this quantity we perform a contour integration along the circle 7 : <fi 1— > se 1 ^, 
where s < 1 is chosen in a way such that 7 encircles all eigenvalues of A. By the Residue 
Theorem (note that \z\ = 1) we have that 



; z-rX k 



n 



1 - r 2 XjX k 
rX k - rXi 



1 r x n 1 

" 2vri X S r (A)^-rA 



dA. 



(16) 
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Integration by parts gives 

1 f A n 1 



2vri 7 7 B r (A)«-rA 
and we arrive at 



dA 



B n+1 



2m(n + 1) 



v n+l 



1 



£ r (A)(z-rA)J 



dA 



(17) 



il^oo < —, — T SUP / 

27r(n+ 1) | z |=i7 7 



B r (A)(z-rA). 



|dA|. 



The right hand integral can be interpreted as the arc length of the image of 7 under the 
rational function - —7 — . For this quantity we have by Spijker's Lemma ([24], Equation 

(4)) 



_B r {\){z-r\)_ 



|dA| < 2ir{\m\ + 1) sup 
|A|=» 



B r (X)(z -rX) 



and conclude that for < r < 1 and \i < s < 1 we have 

11-11 ^ n+l(H+ 1 ) 1 

5 hoc < s + - — — — sup 

(n + l l-rs| A | =s 



n 1 - V 2 A 
11 r \ _ 



rA — r\i 



In the above bound we choose s = (l + l/n)fx (where fx denotes the spectral radius of A) 
and notice that 



s n+l = ( 1 + 



1 



n 



n+l 



< e(l + l/n)/i' 



n+l 



which entails 



boo < 



/ u n+1 (|m| + l)e 
nr H(l - r (l + l/n)/x) T^l= 

(l + l/n) M 



sup 



II \ w 



A - A,; 



and 



I, nn . / _L~ /x w+1 (M + l)e 

i* < y ! _ r2 nrH(1 _ r(1 + 1/n)/i) sup 



Finally, we observe that 

1 - A,r 2 A 



sup 

|A| = 
(l + l/n) M 



n 



A - A, 



sup 

(l+l/n) M 



< sup 

|A| = 

(l+l/n) M 



1 



B(A) 
1 



B(A) 



(l+l/n) M 



n 



1 - A,r 2 A 



A -A, 



n 



1+ 



A,A(1 - r 2 ) 



1 + 



1 " A,;A 



1-r* 



l-/i(l + l/n) 



|m| 
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We can choose 1 — r 2 = 1 ^^^^ and get 



2e 2 // i+1 v / H(H + 1) 

Ww/mW < n(1 _ (1 + 1/n)M) 3/2 ™P 

(l + l/n) M 



B(A) 



where we used the fact that (1 + l/|m|)' m ' < e and that, by the Bernoulli inequality for 
|m| > 1, M > (1 - 1 ~^ 2 1+ ^ ) > 1/2. 

We now specialize the above derivation to the case when A = T — Too- By assumption it 
holds for any n and Te T that ||T n || < C and it follows that 



ICr-Toof! 

In total we can assert that 

\\T n -T£\\ =\<T-% r\ < 



\\T n -T£\\ <mi + rai <2c. 



4Ce 2 \m T - T J 1/2 (\mr^rJ + 1) ■ V n+1 



n 



sup 

|A| = 
(l + l/n) M 



5(A) 



completing the proof of Theorem IV. 3. 



□ 



C. Comparison to the Schur and Jordan convergence bounds 



Theorem IV. 3 significantly improves upon both the Jordan and the Schur bounds, 
Theorems III. 3, III. 2. In this subsection we shall illustrate this comparing the different 
convergence estimates for a semigroup of quantum channels. Since for all T £ T+ we have 
that ||T|| = 1, Theorem IV. 3 gives a bound for the diamond norm. With the notation of 
Theorem IV. 3 we have that 



< 



n(l-(l + £)M) 
4e 2 ^/jm[(|m| + 1) • \i 

' (l"(l + ^) 3/2 



M (l+l/n) 



1 



n 

m/{z-X D ) 



B(z) 



(18) 
(19) 



For the inverse Blaschke product in (19) we can establish lower and upper bounds. The 
function ^~xy~j~~~ is monotonically increasing with iG [0,/i] and we have that 



,(l + l/n)/x 



H-i 



s n 

m/(z-\ D ) 



1-(1 + I) M |A < 
/i-IAil + S 



M — 



< r(l-^) 



/I 



H-i 



(20) 



In the following we compare Inequalities (18), (19) to the corresponding bounds resulting 
from the Jordan and Schur decompositions. 
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Comparison with the Jordan bound. To establish a convergence bound for 
quantum channels in diamond norm one can use Theorem III. 2 together with the norm 
equivalence relations (3), (4). But as Theorem III. 2 has a qualitative character only (i.e., 
it does not specify C±, C2), the constants coming from the norm equivalence are of no 
relevance. As expected, both Theorem III. 2 and Inequality (19) include an exponential 
factor ji n . Suppose that the largest Jordan block for Ad has size and that there is no 
other eigenvalue of T — Too of magnitude [i. Then the minimal polynomial of T — Too 
contains a factor {z — XdY^ an d in (19) there are cL — 1 factors for this eigenvalue. 
The denominator in Inequality (19) leads to a factor (n/fi) d ^~ 1 in this estimate, which 
captures the same qualitative n-dependence as the upper bound of Theorem III. 2. Due to 
the lower bound in Theorem III. 2 the factor [nj fi) d ' 1 ^ 1 is also necessary. But as compared 
to Theorem III. 2 Inequality (19) bears the obvious advantage that it specifies C2. On 
the other hand if there are several distinct eigenvalues of magnitude fi, Inequality (19) 
does not yield the correct asymptotic behavior from Theorem III. 2, since any eigenvalue 
of magnitude [i occurring in m contributes a factor n//j,. The reason for this lies in 
the estimate (18), i.e., in bounding each Blaschke factor individually, which leads to 
Corollary IV. 4. Roughly speaking, if there are distinct eigenvalues of magnitude fi then, 
for sufficiently large n, any z of magnitude //(l + 1/n) can be close at most to one of those 
eigenvalues. It is not difficult to make this intuition precise and prove the upper bound 
of Theorem III. 2 based on Theorem IV. 3 with the additional advantage of specifying 
C2. Finally we note that the occurrence of the correct asymptotic n-dependence in 
Theorem IV. 3 is linked to the integration by parts in (17) and our application of Spijker's 
Lemma. This procedure yields the 1 jn prefactor in Theorem IV. 3, which is canceled by one 
inverse Blaschke factor in Corollary IV. 4. Had we bounded (16) directly by the supremum 
of the integrand on the circle, we would have obtained an estimate where one factor in the 
Blaschke product is proportional to n//i even in case of only one eigenvalue of magnitude [X. 

Comparison with the Schur bound. Taking into account the norm equivalence 
relations (3), (4) the Schur bound entails 

||T" - 7£L < 2 J D 3 / 4 (/i + 2D 1 /4 ) D-i n D- V-ZW 

If one assumes that \d is D-fold degenerate with maximal Jordan block this results in a 
factor (n/ [i) D ~ l in Inequality (20) . Hence, even in the case of the worst Jordan structure for 
T — To-, Theorem IV. 3 improves upon bounds obtained from Theorem III. 3 exponentially 
in the D-dependend prefactor. 

Finally, we discuss some implications of the lower bound in (20). We use that bound 
to estimate how good the upper bound of Corollary IV. 4 can possibly be. Note that 
the left hand side of Inequality (20) contains a factor (l///)l m l -1 . If all eigenvalues of T 
are distinct this factor grows with the dimension of the system. That is, for "generic" 
T it needs D time steps until Corollary IV. 4 can yield a nontrivial statement. This is 
unfortunate from the point of view of applications, where one is looking for estimates such 
that poly(\og(D)) steps are sufficient. It is natural to ask whether or not Theorem IV. 3 
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is optimal and whether one might be able to dispense of the (l//x)l m l _1 prefactor. The 
following subsection discusses aspects related to the optimality of Theorem IV. 3. Even full 
information about the spectrum (alone) is never sufficient to prove poly (log D) convergence. 
To overcome this issue one may use properties of the semigroup beyond its spectrum. One 
important class of semigroups for which fast convergence can be proved under additional 
assumptions are detailed balanced semigroups (Definition V.l). We discuss the convergence 
of such semigroups in detail in Section V. 

D. Semigroups of Hilbert space contractions 

In this subsection we discuss semigroups of Hilbert space contractions. More precisely, 
suppose we are given a semigroup (T n ) n >o of linear operators acting on a finite-dimensional 
Hilbert space such that fTH^ < 1. As before, our major interest lies in bounding the 
quantity \\T n — 7^1^ in terms of the spectrum of T ■ Clearly, this setup is less general 
than our main setup in Section IV B and one can expect better bounds. In what follows 
we derive an analog of Theorem IV. 3 for contractive semigroups and discuss the optimality 
of the obtained bounds. 

Let us adopt the notation from Theorem IV. 3. As before we write a(T — To) for the 
spectrum and m = rn-j--j 00 f° r the minimal polynomial of T — To- B(z) = Y[i 
denotes the Blaschke product associated with m. To avoid cumbersome notation we shall 
again assume that m has simple zeros. The extension to the more general case does not 
result in any difficulties. Before we proceed with our main discussion we briefly introduce 
some notation and standard concepts from spectral operator theory. We define the \m\- 
dimensional model space 

K B := H 2 Q BH 2 := H 2 n (BH 2 )^, 

where we employ the usual scalar product from the Hilbert space H 2 . The model operator 
Mb acts on Kb as 

M B : K b ^ K b 

f^M B (f) = P B (zf), 

where Pg denotes the orthogonal projection on K B - In other words, Mb is the compression 
of the multiplication operation by z to the model space Kb (see [13] for a detailed discussion 
of model operators and spaces). As multiplication by z has operator norm 1 it is clear that 
Mb is a Hilbert space contraction. More precisely, for any <fi £ H°° the norm of (p(M B ) 
can be evaluated using Sarason's lifting Theorem [15, 20] as 

U(Mb)\\oo = U\\H°°/mH~ ■ (21) 

We can also write ^(Mb)^ as variational expression in the Hardy space H 1 . From [5] 
we get that 

1^)1^= sup 

FeH 1 

ll-Flll^ 1 



1 



(22) 
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Note that this trivially implies 

1 



2vri ./ N=1 B 



dz 



< IWB)^ < SUp 
1*1=1 



B 



It can be shown that the spectrum of the model operator Mb defined above is given by the 
zeros of the corresponding Blaschke product B. In our case this means that T — Too an d 
Mb have identical spectrum. Hence, to any T we can associate a (completely non-unitary 
[11]) contraction Mb having spectrum a = a(T — Too)- 

Let us proceed by studying convergence estimates for the contractive semigroup of the 
form of Inequality (1). To start with, we prove that if fTf^ < 1 then \\T — Tooioo < L i-e. 
the semigroup {(T — Too) n }n>o is contractive, too. 

Proposition IV. 5. Let (T n ) n >o be a contractive semigroup on a Hilbert space and let Too 
be as in Equation (8). Then 

(i) the semigroup {{T — Too) n }n>o is contractive, and 

(ii) ifT*(e) = Xe with |A| = 1, then T(e) = Ae. 

Proof. Both follows from the fact that any contraction on a Hilbert space admits a unique 
decomposition into an orthogonal direct sum of a unitary and a completely non-unitary 
operation ([11], Theorem 3.2). In our case, Too corresponds exactly to the unitary part of T 
and T —Too is a (completely non-unitary) contraction, hence (i). (ii) is then a consequence 
of the normality of the unitary part. □ 

The second part of Proposition IV. 5 generalizes the fact that for classical as well as 
for quantum Markov processes, contractivity implies that the transition map is doubly 
stochastic. In fact, in those cases the converse implication holds as well [17]. 

From the first part of Proposition IV. 5 and by Inequality (12) it follows that 

\\T n -T2\\ 00 = \\(T-T 00 ) n \\ 00 <\\z n \\ H00 . 

Our previous considerations from Section IV A, Lemma IV. 1 furthermore imply 

II T" 71 ~~ Toolloo — II^Ih 00 /mH°° ■ (23) 

We conclude from Equation (21) that in order to upper bound (23) it is sufficient to 
consider || Mg \\ . This is in contrast to our discussion of bounded semigroups on Banach 
spaces, where we had to rely on the Cauchy-Schwarz Inequality (15). In addition, we note 
that \<t>\ HOO / mH c=o = | < / ) (-^b)|| 00 allows us to work with \\-\\ HOO / mHOO directly and we do not 
require an ad hoc function to upper bound (14). 

In our study of bounded semigroups in Section IV C we have encountered a factor 
(l/ M )M-i in (20) that grows exponentially with the dimension of the space on which the 
semigroup acts if all eigenvalues of the generator are distinct. The following proposition 
shows that, if in a bound of the type (1) K only depends on the eigenvalue structure of 
T and on n, then K must contain such a factor. We achieve this by showing that for any 
contractive semigroup with generator T there is a contractive semigroup whose generator 
has the same spectrum as T but which converges slowly if n is small. 
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Proposition IV. 6. Let (T n ) n >o be a contractive semigroup acting on a D -dimensional 
Hilbert space and let m = m-y-Toc denote the minimal polynomial of T — To and B the 
corresponding Blaschke product. Then there is a contractive semigroup (£ n ) n >o such that 
£ has the same minimal polynomial as T and 



In particular, for all n < \m\ < D it holds that 



The supremum in Proposition IV. 6 is attained by a function F = f 2 , where / is in 
the unit ball of Kb [5]. Hence, the optimization effectively goes over a finite-dimensional 
vector space of rational functions with fixed poles and bounded degree (see [5] for details) . 
One can obtain simple lower bounds on the convergence speed of (£ n ) n >o by choosing a 
certain / 6 Kb and evaluating the integral with the Residue theorem. 

The second assertion of Proposition IV. 6 states that for any spectrum we can construct a 
semigroup such that the distance of the evolution to its asymptotic behavior stays maximal 
for at least \m\ — 1 time steps. Clearly this implies that one cannot prove that poly(log \m\) 
time steps bring the semigroup close to its stationary behavior if only spectral data is given. 

Note that if in a bound of the form \T n — T£>\ < Kfi n , with a bounded semigroup 
{T n )n>o, K only depends on the spectrum of T then by Proposition IV. 6 we have 1 < 
Kfi \m\-i m T hat is, in this case we obtain the lower bound K > (l/fiy™ 1 ' . 

Proof of Proposition IV. 6. The first assertion is clear by choosing "<? := Too © Mb" such 
that £oo = Too (on the unitary subspace) and £ — £oo = 0®Mb- For the second we consider 
the extremal problem Equation (22). Let ip be any rational function with poles away from 
the unit circle \z\ = 1. Corollary 5 in [5] asserts that, we have 



if and only if ip is a constant multiple of the quotient of two finite Blaschke products B±,B2 
having no common zeros and such that the degree of B\ is strictly smaller than the degree 
of B2 (\Bi\ < I-B2I), i.e., ip = Cj^ for some c G C. Let B denote the Blaschke product 
associated with m, it follows readily that 






holds for n < \m\. 



□ 
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To gain a better understanding of weather the derivation of Theorem IV. 3 is optimal, 
i.e. whether or not the obtained estimate is sharp, let us prove an analog of Theorem IV. 3 
for semigroups of Hilbert space contractions. The derivation is based on techniques similar 
to those that led to Theorem IV. 3, but in the case at hand we can take a more direct 
approach based on the theory of model operators. 

Proposition IV. 7. Let (T n ) n >o be a contractive semigroup on a D -dimensional Hilbert 
space and let To be the operator introduced in (8) (i.e., the unitary part ofT). We write 
m = m-T~Too f or ^ e minimal polynomial and fx for the spectral radius of T — Too ■ B 
denotes the Blaschke product associated with m. Then, for n > j^— we have 



Iir*-7^|| <m 



n+1 



2\m\e 



n(l - (1 + l/n) 2 // 2> 



sup 

\z\=ix(l+l/n) 



B{z) 



As before, we can bound all terms in the Blaschke product individually (see Appendix A) 
and find (compare Corollary IV. 4) 



\\T n -TZ\L<^ 



2\m\e 



1 _ (i + i/ n )2 2 II ^(i + i/„) _ Ai ' 

i^\m\ 



n 



Proof of Proposition IV. 7. The derivation proceeds along the lines of Theorem IV. 3. We 
use an H°° functional calculus to bound \T n — 7^1^ in terms of \z n \ Hao i mHOO . The latter 
expression can be rewritten using a contour integral similar to Equation (16), integrate by 
parts, and finally apply Spijker's Lemma. We have already mentioned that 



\\T n 



T n II < II ? n II 
'oolloo — II ^ W H°° /mH° 



= II AfSII 

II tswoo 



sup 

FGH 1 



1 



l*l=i 



B 



-Fdz 



and that the supremum in this extremal problem is attained by some function F = f 2 
with / G Kb [5]. Thus, F/B is a rational function with 2\m\ poles located at 
(£i> •■•) £|m|> Cf 1 ' ■■■■■> Cu^i)) where £j are the zeros of m. In the above integral we can change 



>|m| 

the contour of integration and integrate along the circle 7 : <; 
by parts and and applying Spijker's Lemma [24] we obtain 



/x(l + l/n)e^. Integrating 



1 

2ld 



B 



Fdz 



1 



2vr(n + 1) 



< 



< 



/i" +1 (l + l/n) n+1 
2vr(n + 1) j- 

2\m\n n+1 {l + l/n) n+1 
n + 1 




sup 

\z\=fi{l+l/n) 
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It is known that for F G H 1 and z£D one can bound |F(,z)| < jzyjp 11-^1 ffi [8] an d with 
(1 + l/n) n < e we finally obtain 



2\m\eu n+1 



n{\ - (1 + l/n)V) 



sup 

|*|=,i(l+l/n) 



E. Slow convergence for Markov chains 



□ 



Proposition IV. 6 provides an example of a slowly converging contractive semigroup with 
arbitrarly given spectrum. One might wonder in how far the phenomenon extends to the 
Markov chain setup. When T is the transition map of a classical or quantum Markov 
chain, is it possible to prove (1) where, K should only depend on the spectrum of T and n 
but such that the stationary behavior sets in after poly(\og{D)) time steps? The following 
example shows that this can not be the case. 

We construct a classical stochastic D x D matrix T with real positive spectrum such 



that \\T r - 



-oo || 



2 for n < D — 2. Let, as always, ji denote the spectral radius of 



T — Tqq. We write {ei}i=\,...,D for the canonical column vectors, i.e., (ej)j 
Aj G [0, 1), 1 < i < D - 1, we define 



Sij and for 



/ A! 

1-Ai 



V 



A 2 

1 — A2 A3 



Ad-i 
1 - A D -i 



V 



T is a stochastic matrix with spectrum <r(T) = {Ai, \d-i, !}• Since Aj < 1 for large n 
the image of T n converges to an one-dimensional subspace corresponding to the eigenvalue 
1. We have that = lim n ^ 00 T n and observe that T^ei = en- It is not difficult to 
see that for n < D — 2 the D-th entry of the vector T n e\ is always zero, (e£>|T n ei) = 0. 
It follows that \{T n — T 00 )e\\ l = 2 (where f-^ denotes the 1-norm, Section II B) and we 
conclude that \\T n - T^W^ = 2 for n < D 



2. As before, if K only depends on the 
spectrum of T this implies that K > (l/u) ~ 2 . Note that the above reasoning does 
not depend on the exact values of the eigenvalues (as long as they are non- negative) . 
This suggests that generally the spectrum <r(T) does not contain sufficient information to 
prove poly(log D) fast convergence estimates. Since every classical stochastic matrix can 
be embedded into a quantum channel, the lower bound on K is also true for quantum 
channels. 



V. CONVERGENCE BOUNDS FROM DETAILED BALANCE 

Applications often rely on fast convergence in the sense that poly (log D) steps should 
suffice for the asymptotic behavior to set in. In our previous discussion we have argued 
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that such bounds cannot rely on spectral data alone. To obtain better convergence es- 
timates one requires additional knowledge about the semigroup. In this section we will 
derive convergence estimates for a general bounded semigroup under the condition that 
its generator be related to a Hermitian map in a certain way - for classical and quantum 
Markov processes this will correspond to the well-known detailed balance condition (see, 
e.g., [1, 7, 26]). Throughout this section we require the state space V to be equipped with a 
scalar product (-|-), which induces norms ||-|| 2 and ||-|| on V and C(V), respectively, and for 
convenience we will sometimes assume an orthonormal basis in V to be fixed (cf. Subsection 
II A). 



A. General bound 

We start with a generalization of the well-known detailed balance condition for classical 
Markov chains. This allows us to employ the corresponding property in the context of 
bounded semigroups. 

Definition V.l (Detailed balance for linear maps). Let a linear map T € £(V) be given. 
If Be £(V) is positive-definite (i.e., (v\B{v)) > Vt> € V \ {0} ) and satisfies TB = BT* , 
then we say that T satisfies the detailed balanced condition (with respect to B). 

This definition is equivalent to saying that T is Hermitian with respect to some scalar 
product on the space V, namely the scalar product (■\B~ 1 (-)}, but we choose the formulation 
with given scalar product (-|-) (independent of T) and explicit use of B. Note further that, 
due to strict positive-definiteness, B in the above definition is in particular Hermitian and 
invertible. In conventional formulations of the detailed balance condition the map B is not 
required to be strictly positive-definite, but we do so here as the derived bounds become 
trivial otherwise (see below). 

The detailed balance condition for a linear map T gives 

which means that B~ l l 2 TB 1 l 2 is Hermitian, therefore has only real eigenvalues Aj G R 
(i = 1, . . . , D), and is unitarily diagonalizable: 

Ai \ 

WB'^TB^U = A = 

v W 

This equation implies that T is diagonalized by the similarity transformation S := B^I^IA 
(i.e. S~ X TS = A). Note that T has spectrum {Aj}j, too. 

If T is now power-bounded, i.e., the generator of a bounded semigroup, the definition 
in Equation (8) implies that Too is diagonalized by S as well, 



U*B~ l l 2 TooB 1/2 U = A 



OO) 
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where A^ is obtained from A by deleting all entries of magnitude smaller than 1. A — Aoo 
is thus diagonal with operator norm \i < 1 , where jjl is the spectral radius of T — %o ■ We 
thus arrive at the following convergence estimate: 



KT-Toc 



< 



B l/2 



IMIoolKA-A 



ooy Hoc 



£-1/2 



£1/2 



(The latter two factors may be recognized as the condition number of B 1 / 2 .) We formulate 
this as a theorem: 

Theorem V.l. Let V be a (real or complex) vector space with scalar product, and T G C(V) 
be the generator of a bounded semigroup (T n ) n >o, which satisfies detailed balanced w.r.t. a 
positive-definite B G C(V). Denote by fj, the spectral radius of T — Too- Then, for any 
n G N, 



where 



denotes the operator norm on C(V) 



B l/2 £-1/2 



We now discuss detailed balance more specifically for classical and quantum Markov 
chains. First observe that, if e G V is a fixed point of T*, i.e. T*(e) = e, then 7r := B(e) 
satisfies 



T(tt) = TB(e) = BT*{e) = B{e) = tt , 

i.e., 7r is fixed by the semigroup generator T ■ Conversely, if tt is a fixed point of T, then 
e := S _1 (7r) is left invariant by T* ■ For a classical Markov chain the generator satisfies 
T*(e) = e with e = X^£i e « = (1> ■•■> 1) an< ^ = 1 holds for generators of quantum 

Markov chains (see Section II B). Thus, for classical and quantum Markov chains the 
detailed balance condition immediately yields a fixed point of the transition map. 

In the theory of classical Markov chains, a stochastic matrix T G M. dxd is usually 
defined to be detailed balanced w.r.t. the probability distribution it G W 1 (i.e. 7Tj > and 
= 1)' ^ TjiiTi = Tijiij holds for all i,j (see, e.g., [1, 7]). Defining a diagonal matrix 
B with entries Ba := m, the latter condition can be written as TB = BT*. If furthermore 
the fixed-point probability distribution tt has full support (i.e. 7Tj > Vi), then T is detailed 
balanced w.r.t. B in the sense of our Definition V.l. (tt = Be will necessarily be a fixed 
point of T.) Due to normalization it holds that minj7Tj < 1/d. Using this and the norm 
equivalence (2), Theorem V.l yields the following well-known convergence estimate [1, 7] 
for the special case of a classical Markov chain that satisfies detailed balance w.r.t. the 
distribution tt: 

W^-TZL, < fjrVdJ^^- < -J?—. (24) 

y mmj TTi mm; tti 



27 



This estimate may become trivial if detailed balance is denned without the full-support 
condition on ir as one may then have minj 7Tj = 0. On the other hand, if one has a positive 
lower bound on minj7Tj, Equation (24) may become a useful convergence estimate. This 
technique is frequently used for detailed balanced chains that have a (unique) full-rank 
probability distribution as fixed point, and where one can find a "good" lower bound on 
minj 7Tj [1, 2, 7]. Often the situation arises that the chain converges to a Gibbs state 
7Tj = e~^ Hi /Z at finite inverse temperature j3 G [0, oo) with Z := e~@ Hi . An important 
class of Markov chains that obey the detailed balance condition are Metropolis Hastings 
Markov Chains [7]. 

There are different generalizations of the detailed balance condition to quantum Markov 
chains [26], which we, however, all capture by Definition V.l. Let us specialize to the quan- 
tum detailed balance condition that most immediately generalizes the classical condition 
from the previous paragraph to the non-commutative case in a symmetric way and that 
has been employed for proving convergence of quantum Markov chains before (e.g. [27]). 
Namely, given a positive trace-preserving map T G T acting on the set Aid of d x d ma- 
trices, we consider the detailed balance condition induced by the map B a (X) := ^foX^fo, 
where a G Aid is a density matrix of full rank. Again, due to trace-preservation, it is easy 
to see that if T is detailed balanced w.r.t. B a , then a = B a (t) is a fixed point of T . This 
leads to the following convergence result for the quantum case: 

Corollary V.2. Let T : Aid — > Aid be a positive trace-preserving map, and a G Aid be a 
full-rank density matrix such that 

V^TpOV^ = T(y/oXy/Z) MX G Ai d . 
Denote by \x the spectral radius ofT — Too- Then, for any n £ N, 



V Amin{0-) 



^rram( cr ) 



where \ m i n (a) and A maa; (a') denote the minimal and maximal eigenvalues of a, respectively. 
If, in addition, a = e~^ H /tr(e _/3 ^) is the Gibbs state at inverse temperature (3 G [0, oo) of 
a bounded Hamiltonian H G Aid, then 

WV'-TZh-! < v n de 2 » H ^ . 
Proof of Corollary V.2. The conditions on T imply that it is detailed balanced w.r.t. the 



map B a defined above. Computing 



B„ 



1/2 



= \JKmx{o) and B a 1/2 = 1/ ^\ min {o-) 

oo 

and considering the norm equivalence (4) and bounding \ m in(o~) < 1 / <i and A max (a) < 1, 
we get the first assertion from Theorem V.l. In case of a thermal state, the second assertion 
follows from 



; e -m \ e -/3|//iu e-mm 



tr(e-P H )J ~ tr(e^loci d ) d 

□ 
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This Corollary provides a possible way for proving that a state preparation or algorithm 
is efficient in the sense of computational complexity [12]. More concretely, for each N, 
consider a system of N particles (spins), each with finite Hilbert space dimension s < oo, 
and a Hamiltonian Hjy on each system. In many physical situations the Hamiltonian will 
be bounded by some polynomial of the particle number, liJjvloo < cnN k ; this occurs for 
example if Hjy = J2i^N,i is a sum of fe-local terms that are uniformly bounded by ch- 
Assume further that the thermal state ct/v = e~@ HN /tr(e~P HN ) at inverse temperature 
/3 G [0, oo) is a fixed point of the positive trace-preserving map 7at, and that T/v satisfies 
detailed balanced w.r.t. B aN . This assumption may be fulfilled, e.g., by Gibbs dynamics in 
a Markov Chain Monte Carlo Algorithm [27]. Lastly, assume that the spectral gap of T/v is 
asymptotically lower bounded by an inverse polynomial c^/N a of N (where c M > 0), i.e., 
the eigenvalue 1 corresponding to a n is the only eigenvalue of Tn with modulus 1 whereas 
I Ai| < 1 — c^/N a for all other eigenvalues. Among these assumptions, when they apply, 
the latter one is usually the hardest one to prove in a given situation. 

Under these presuppositions, the evolution operator T/v prepares the final state ct/v 
efficiently in the system size N . More precisely, for any initial state pn of the system, the 
time-evolved state T^(p/v) after n steps will be e-close in trace-norm to the thermal state 
a N (i.e. \7%(pn) ~ oaHIi - e ) if 



This means that the runtime to e-convergence scales at most polynomially in the particle 
number N and polylogarithmically in the desired accuracy e G (0, 1], which proves efficient 
state preparation. 

For a proof of the runtime bound Inequality (25), note that the dimension of the N- 
partite system is djy = s N and that, due to the spectral gap condition, (Tn)oo{pn) = &N 
for any state pN, which implies \\7]}(pn) — caHIi — l^N ~ (TaOooIIi-i- Finally, we use 



in the Gibbs state bound from Corollary V.2 and requiring the latter to be at most e shows 
that the condition in Inequality (25) is sufficient for e- convergence. 

If one wants to bound the diamond norm ||T n — T2llo between the actual and the 
asymptotic evolution in Corollary V.2 instead of the trace-norm, then by Inequality (3) 
one incurs another factor d (or 1/A m , n (a)) in the upper bounds. This however does not 
affect the efficiency statement just obtained, as the asymptotic dynamics (Tjv)oo is still 
reached, up to e, in polynomial time. 



In this Subsection, again based on the detailed balance condition, we derive a sharper 
convergence bound than in Subsection VA, taking into account all eigenvalues and eigen- 
vectors of the transition map T. The special case of this bound for classical Markov 
processes has been used to prove so-called cutoff dynamics [1, 2, 7]. After describing the 




(25) 



B. An £ 2 bound 
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approach for general bounded semigroups obeying detailed balance, we will specialize to 
quantum Markov chains. 

Recall from above that, if T € £(V) is detailed balanced w.r.t. B, its eigenvalues Aj 
are real. Furthermore, as 1Z := B^^TB 1 ! 2 is a Hermitian operator, it has a complete 
orthonormal eigenbasis i.e. lZ{xi) = AjXj. From this we can define an eigensystem 

of the adjoint T*, which will play a prominent role in the bound: 

Vi ■= B~ 1/2 (xi), which implies T*(yi) = \yi ■ 

{Ui}i could alternatively be chosen as any eigensystem of T* that is orthonormal w.r.t. the 
weighted scalar product 

The spectral decomposition TZ(v) = ^ Xi(xi\v)xi now gives: 

B- 1/2 T n {v) = lZ n B- 1/2 {v) 

D 

i=i 

D D 

i=i i=i 

Recognizing that £> 1//2 (xj) is the right-eigenvector of T corresponding to y i: the terms with 
| Aj| = 1 in the last expression (which we assume to be i = r + 1, . . . , n) correspond to the 
asymptotic evolution 7^. We can thus write 

r 

B -l/2 {T n_ T n ){v) = Y j K{y t \v) Xl , 

i=l 

which, together with the fact that {xj} is an orthonormal system, gives by squaring: 

\\{T a -TZ){v)\l B -,:= ((T n -TZKv)\B~Hr n -T^(v))= jz^^v)? ■ (26) 

i=i 

This equality relates the eigensystem corresponding to the eigenvalues with modulus 
smaller than 1 to the convergence in a suitably modified Hilbert norm. By itself this 
relation does not seem very useful, although one can derive Theorem V.l from it by rescal- 
ing the modified scalar product back to the originally given one. 

When specializing to the quantum case, however, we can make a connection to the 
induced trace-norm, and thereby strengthen Corollary V.2: 

Proposition V.3. Let T : Aid — > Aid be a positive trace-preserving map, and o € Aid be 
a full-rank density matrix (i.e. tr(cr) = 1, a > 0) such that the detailed balance condition 

v^r*(X)V^ = T{^X^) VX£M d 

holds. Let {Aj}£ =1 be the part of the spectrum of T in the open interval (—1,1), and Y{ 
be the corresponding eigenvectors of the adjoint map T* , orthonormal in the sense that 
tv(Y*a l ^ 2 Yja 1 / 2 ) = 5ij. Then, for every Z G Aid (e.g. a quantum state): 

\{T-1Z){Z)\l < J>(17Z)| 2 A?». (27) 
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Proof. One can apply the preceding general steps to the map B a (X) := ^foX^fo and the 
inner product (Y\X) := tr(V*X). Then it remains to show that, for A := (T n - T£)(Z), 

\\A\\l < (A^iA)) = ti{A*a- l l 2 Aa' l l 2 ) . 

To see this inequality holds in fact for all A E A4^, use the polar decomposition and let 
U £ Aid be a unitary such that UA is positive-semidefinite. Then cyclicity of the trace 
and two applications of the Cauchy-Schwarz inequality give: 

1= \tr[UA]\ 2 = tr [(^4^1/4)^-1/4^-1/4) 



< tr 
= tr 

< 



a^Ua^Wa 1 ' 4 ] tr la'^A^-^Aa' 1 / 4 



Uo- l / 2 U*o- 1 ' 2 



tr 



A* a- 1 ' 2 Ac-- 1 ' 2 



^/tr [UaU*] tr [a^UU*^/ 2 ] tr \a* o~ x l 2 Ao' 1 ' 2 



= tr(cr)tr 



A*a- 1 l 2 Aa- 1 ' 2 



= tr 



A*a- 1 / 2 Aa- 1 / 2 



□ 

Detailed balance of a quantum map T w.r.t. certain other maps (B = f^) 1 has been 
defined in [26] so that the family includes the map B a from above. These detailed 

balance conditions also result in bounds that look essentially like Equation (27), except that 
in this more general case the Yi should be orthonormal in the sense that tr(Y*B(Yj)) = 5ij. 
For a proof, note that Equation (26) holds generally, and the proof of Lemma 5 in [26] 
shows ||(T n - T^)(Z)\\l < ||(T n - 7S)(v)|2 jB -i (the right-hand-side of the last inequality 
is a x 2 -divergence as considered in [26] only if T has merely one eigenvalue of modulus 1, 
however) . 

For classical detailed balanced Markov chains the analog of the convergence bound 
Inequality (27), which looks very similar in this setting [7], is often used for demonstrating 
the upper bound in cutoff results (cf. [1, 2] for an over overview and references). In 
this setting, most commonly the evolution T leads to a unique fixed point a (often the 
maximally mixed state), so that the asymptotic evolution would simply be the "projection 
onto the fixed point", i.e. T^(X) = atr(X) for n > 1. Of course, for Proposition V.3 to 
be useful one also needs knowledge about the normalized eigenvectors Y{. 
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Proof. We rewrite the absolute value on the left hand side using the fact that \a\ 2 = aa 
for all a G C. This gives 



Appendix A: An upper bound on a single Blaschke factor 



For completeness we prove the following short lemma. 



Lemma A.l. Let |A| < c < 1 then 



sup 

\z\=c 



1 - Xz _ 1 - |A|c 
z - X ~ c-\X\ ' 



1 - Xz 



2 



(1 - |A|c) 2 + 2|A|c-2^(Az) 
(c- |A|) 2 + 2|A|c-23?(A«) ' 



z — X 



Note now that for < f3 < a and < x we have 



a + x a 



Hence 



(l-|A|c) 2 + 2|A|c-2^(Az) (l-|A|c) 2 
(c-|A|) 2 + 2|A|c-2K(Az) " (c-|A|) 2 ' 



Finally, we note that the supremum is attained for z 



c 



A. 



□ 



